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Abstract.  Rao,  Pathak  and  Koltchinskii  (1997)  have  recently  studied  a  sequential 
approach  to  resampling  in  which  resampling  is  carried  out  sequentially  one-by-one 
(with  replacement  each  time)  until  the  bootstrap  sample  contains  m  «  (1  -e_1)n  « 
.632n  distinct  observations  from  the  original  sample.  They  have  established  that 
the  main  empirical  characteristics  of  the  sequential  bootstrap  go  through,  in  the 
sense  of  being  within  a  distance  of  order  0(n“3/4)  from  those  of  the  usual  boot¬ 
strap.  However,  the  theoretical  justification  of  the  second  order  correctness  of  the 
sequential  bootstrap  is  somewhat  involved.  It  is  the  main  topic  of  this  investigation. 
Among  other  things,  we  accomplish  it  by  approximating  our  sequential  scheme  by  a 
resampling  scheme  based  on  the  Poisson  distribution  with  mean  p  ~  1  and  censored 
at  X  =  0. 


L  Introduction 

Efron  (1979)  introduced  the  bootstrap  method  of  resampling  as  a  ubiquitous  sam¬ 
pling  technique  of  estimating  the  variance  of  an  estimator  and  a  sampling  distribution 
of  a  given  statistic.  In  a  fundamental  paper,  Bhattacharya  and  Ghosh  (1978)  have 
demonstrated  that  Edgeworth  expansions  for  a  wide  class  of  statistics  can  be  derived 
from  Edgeworth  expansions  for  multivariate  sample  means.  This  technique  has  been 
used  by  Singh  (1981)  to  show,  in  the  case  of  univariate  sample  mean,  that  the  boot¬ 
strap  is  more  accurate  than  the  central  limit  theorem  when  higher  order  population 
moments  exist.  These  ideas  are  further  exploited  by  Babu  and  Singh  (1983,  1984) 
to  show  the  superiority  of  the  bootstrap  method  and  by  Babu  and  Singh  (1985)  to 
obtain  Edgeworth  expansions  for  the  ratio  statistic  and  similar  statistics  based  on 
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samples  from  finite  populations.  The  method  is  also  used  by  Babu  and  Singh  (1989) 
to  obtain  global  Edgeworth  expansions  for  functions  of  means  of  random  vectors, 
when  one  of  the  coordinates  has  a  lattice  distribution  and  the  remaining  part  of  the 
vector  has  a  strongly  non-lattice  distribution.  Later  Gene  and  Zinn  (1990)  showed 
that  in  a  certain  weak  sense,  the  bootstrap  method  is  valid  (consistent)  if  and  only  if 
the  central  limit  theorem  holds.  In  fact  the  central  limit  theorem  furnishes  accuracy 
of  approximation  of  order  o(l),  while  if  the  third  population  moment  exists,  one  can 
expect,  in  many  commonly  encountered  populations  the  accuracy  of  the  bootstrap 
method  to  be  of  order  o(n-1^2),  where  n  denotes  the  sample  size.  Thus  while  the 
bootstrap  method  has  the  potential  of  being  second-order  accurate;  the  central  limit 
approximation  is  not  so.  This  is  one  of  the  several  reasons  for  the  current  interest 
and  preference  in  the  literature  for  those  methods  of  resampling  that  are  second-order 
accurate,  i.e.,  accurate  of  the  order  o(n-1/2). 

Stemming  from  Efron’s  observation  that  the  information  content  of  a  bootstrap 
sample  is  based  on  approximately  (1  -  e-1)100%  «  63%  of  the  original  sample,  Rao, 
Pathak  and  Koltchinskii  (1997)  have  introduced  a  sequential  resampling  method  in 
which  sampling  is  carried  out  one-by-one  (with  replacement)  until  (m  +  1)  distinct 
original  observations  appear,  where  m  denotes  the  largest  integer  not  exceeding  (1  — 
e~l)n.  The  last  observation  is  discarded  to  ensure  simplicity  in  technical  details.  It 
has  been  shown  that  the  empirical  characteristics  of  this  sequential  bootstrap  are 
within  a  distance  of  order  0(n_3/4)  from  the  usual  bootstrap.  The  authors  provide 
a  heuristic  argument  in  favor  of  their  sampling  scheme  and  establish  the  consistency 
of  the  sequential  bootstrap;  however  the  question  of  second-order  correctness  was  not 
addressed. 

One  of  the  main  advantages  of  the  sequential  bootstrap  over  the  classical  fixed 
sample  size  bootstrap  is  its  performance  in  estimating  the  variance  of  an  estimator, 
when  the  original  data  contains  several  identical  values.  This  situation  occurs  when 
the  sample  is  drawn  from  a  population  with  a  distribution  that  is  not  continuous.  To 
be  more  specific,  suppose  Xi,...  ,Xn  are  i.i.d  random  variables  satisfying  P(X i  = 
0)  =  .3.  Then  with  positive  probability,  about  30%  of  the  data  X,  are  equal  to  0. 
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With  positive  probability,  several  bootstrap  resamples  end  up  in  n  zeros,  leading  to  a 
zero  estimate  of  variance.  On  the  other  hand  the  sequential  bootstrap,  by  sampling 
until  m  distinct  labels  X,  are  selected,  guarantees  a  resample  that  contains  elements 
other  than  0.  Hence  the  sequential  bootstrap  scheme  has  an  edge  over  the  classical 
bootstrap,  especially  when  dealing  with  categorical  data. 

The  main  object  of  this  paper  is  to  examine  the  second  order  correctness  of  the 
sequential  bootstrap.  The  theoretical  justification  of  this  is  somewhat  more  difficult 
because  of  the  dependence  among  the  bootstrap  sample  units.  At  this  time  a  rigorous 
Edgeworth  expansion  under  this  kind  of  dependence  is  unavailable  in  the  literature. 
A  cumbersome  approach  based  on  computation  of  cumulants,  under  the  (unsubstan¬ 
tiated)  assumption  that  a  formal  Edgeworth  expansion  is  valid,  may  be  given  along 
the  lines  of  the  Hall-Mammen  (1994)  paper.  This  does  not  lead  to  a  complete  solu¬ 
tion  as  the  Edgeworth  expansions  are  not  known.  Instead  we  first  approximate  the 
sequential  bootstrap  by  another  sequential  resampling  scheme  based  on  the  Poisson 
distribution.  Under  the  new  scheme  the  “independence”  of  sample  units  under  re¬ 
sampling  is  preserved.  A  rigorous  justification  of  the  Edgeworth  expansion  can  now 
be  given  more  easily.  This  then  provides  a  sound  theoretical  framework  in  which 
the  second  order  correctness  for  the  sequential  bootstrap  can  be  established.  In  this 
paper  we  concentrate  on  sample  means  of  fc-variate  random  vectors.  The  Edgeworth 
expansions  for  smooth  functions  of  multivariate  sample  means  follow  from  similar 
expansions  for  multivariate  means  as  in  Bhattacharya  and  Ghosh  (1978). 


2.  Sequential  Resampling  Schemes 

Let  S  =  (XUX2,...  , Xn)  be  a  random  sample  from  a  distribution  F,  and  9(F) 
a  parameter  of  interest.  Let  Fn  denote  the  empirical  distribution  function  based 
on  S,  and  suppose  that  9(Fn)  is  to  be  used  as  an  estimator  of  9(F).  The  Efron’s 
bootstrap  method  approximates  the  sampling  distribution  of  a  standardized  ver¬ 
sion  of  y/n(9(Fn)  —  9(F))  by  the  resampling  distribution  of  a  corresponding  statistic 
y/n(9(Fn)  —  9(Fn))  based  on  a  bootstrap  sample  Sn  in  which  the  original  F  has  been 
replaced  by  the  empirical  distribution  based  on  the  original  sample  S,  and  Fn  of  the 


4 


GUTTI  JOGESH  BABU,  P.  K.  PATHAK,  AND  C.  R.  RAO 


former  statistic  has  been  replaced  by  the  empirical  distribution  based  on  a  bootstrap 
sample  Fn.  In  Efron’s  bootstrap  resampling  scheme,  Sn  =  (A\,  X2, . . .  ,  X„)  is  a  ran¬ 
dom  sample  of  size  n  drawn  from  S  by  simple  random  sampling  with  replacement 
(SRSWR).  In  the  Rao-Pathak-Koltchinskii  (1997)  sequential  scheme,  observations 
are  drawn  from  S  sequentially  by  SRSWR  until  there  are  (m  +  1)  =  [n(l  —  e  1 )]  +  2 
distinct  original  observations  in  the  bootstrap  sample;  the  last  observation  is  dis¬ 
carded  to  ensure  technical  simplicity.  Thus  an  observed  bootstrap  sample  under  the 
Rao-Pathak-Koltchinskii  scheme  admits  the  form: 

Ss  =  (XuX2l...  ,Xn)  (2.1) 

in  which  Xi,X2,...  ,Xn  have  m  «  n(l  -  e-1)  distinct  observations  from  S.  The 
random  sample  size  N  admits  the  following  decomposition  in  terms  of  the  independent 
random  variables: 

N  =  Ii  +  /2  +  —  +  Im  (2*2) 


in  which  m  =  [n(l  —  e  *)]  +  1;  h  =  1,  and  for  each  k,  2  <  k  < 


P(h=j)=  1 


T- 


Although  we  have  established  the  consistency  of  this  sampling  scheme,  a  rigorous 
proof  of  its  second  order  correctness  requires  an  Edgeworth  expansion  for  dependent 
random  variables;  such  an  expansion  is  unavailable  in  the  literature  at  the  present 
time.  An  alternative  approach  that  can  be  used  is  to  slightly  modify  the  preceding 
resampling  scheme  so  that  existing  techniques  on  Edgeworth  expansion,  such  as  those 
of  Babu  and  Bai  (1996),  Bai  and  Rao  (1992),  Babu  and  Singh  (1989)  and  others,  can 
be  employed.  A  modification  of  our  previous  resampling  scheme  that  allows  the 
second-order  correctness  to  go  through  easily  is  as  follows: 


Poisson  Resampling  Scheme: 

For  the  selection  of  a  bootstrap  sample  with  a  given  number  m  of  distinct  units, 
under  the  Poisson  Resampling  Scheme  (PRS),  we  provide  a  conceptual  definition  and 
a  practical  approach.  Let  us  take  a  sample  aj, . . .  ,  an  of  n  independent  observations 
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from  P(  1),  i.e.,  Poisson  distribution  with  mean  1.  If  there  are  exactly  m  values  in 
the  sample,  we  accept  it  and  take 

S  =  {(X1,al),(X2,a2),...  ,(Xn,an)},  (2.4) 

i.e.,  with  the  observation  X ;  repeated  a<  times,  as  the  bootstrap  sample.  If  the 
number  of  nonzero  values  in  «i, . . .  ,  an  is  not  exactly  m,  we  reject  the  entire  sample 
and  draw  another  sample  of  size  n.  The  bootstrap  sample  size  N  of  S  as  in  (2.1)  is 
a  random  variable 

N  =  ax  +  ...  +  an.  (2.5) 


A  practical  way  of  implementing  this  resampling  scheme  is  to  first  assign  at  random 
(n  -  m)  a’s  a  value  of  zero  and  to  the  remaining  m  a’s  values  independently  chosen 
from  the  Poisson  distribution  with  mean  p  =  1  and  censored  at  X  =  0.  An  outline 
of  the  equivalence  of  these  two  procedures  is  as  follows. 


Theorem  2.1.  The  moment  generating  function  M^it)  of  N ,  the  sample  size  of  the 

Poisson  resampling  scheme,  is  given  by 

(gle*-1)  _  e-i)‘ 


AMO  = 


(1  —  e~l 


(2.6) 


Proof.  Let  YuY2,...  ,Yn  be  n  Poisson  variables  with  mean  p  =  1.  Then  it  is  easily 
seen  that 

P(N  =  w)  =  const  fsi - - - )  (2.7) 

\  0l!02!  •  •  •  OLm\  J 

where  the  sum  Si  extends  over  all  positive  natural  numbers  oti,  a2, . . .  ,  am  such  that 
0:1  +  a2  +  . . .  +  <*m  =  w.  It  then  follows  that  (see  Pathak  (1961)) 


P(N  =  w)  =  const 


e  ",  w  (m 

— r(m  “  l  i 

w\  \  1 


(m!  -  l)w  +  . . .  ±  l") 


=  const — r(AmXu'|x=o) 
w\ 

where  A  is  the  difference  operator  with  unit  increment. 


(2.8) 


From  (2.8)  it  follows  that 

P(N  =  to) 


(e  -  1)’ 


ytn 

\m_  | 

-A  — j-  \x=o- 
1  w\ 


(2.9) 
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Consequently  the  moment  generating  function  M^(t)  of  N  is  given  by 

_e~ 

(e 

w>  0 


tw  Vw 

MN(t)  =  £(e-")  =  £ 


=  — i — Vi’ 

(e-  l)m  ^ 


A**)' 


w>0 


w 


U=o 


=Xe‘ 


=  A" 


(e  -  1) 

1 


x=o 


^  -  (:y-y + (;) 


,(m-2)e' 


(e-iy 

eme‘  /W\  _t  (™\  2t  x 

=  +Uje  — -} 

eTOe*  « 

- _ (I  _  e~e  V 

(ee‘  -  1) 


•••} 


L  (c  —  i)  J 

r(e(e‘-D  _  e-iy 


(l-e-1) 


This  completes  the  proof. 


(2.10) 


(2.11) 

□ 


The  preceding  theorem  shows  that  the  distribution  of  N  can  be  viewed  as  that  of 
the  sum  of  m  IID  random  variables  with  a  common  distribution  with  the  moment 
generating  function  given  by  the  formula 


m(t)  = 


(e(e*-D_e-i) 
(1  -  e-i) 


(2.12) 


It  is  evident  that  m(t)  is  the  moment  generating  function  of  the  Poisson  distribution 
with  n  =  1  and  censored  at  X  -  0.  Let  Y  denote  a  random  variable  with  moment 
generating  function  m(t).  Then  E{Y )  =  1/(1  —  e-1)  and  V(Y)  =  e(e  —  2)/(e  —  l)2. 
Therefore 


E{N)  =  mE(Y) 
=  n  +  0(1) 


(2.13) 
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and 

V(N)  =  mV(Y) 

=  n(e  —  2)/(e  —  1)  +  0(1)  (2.14) 


With  these  results,  we  now  proceed  to  establish  the  second  order  correctness  of  the 
sequential  bootstrap  based  on  the  Poisson  distribution. 


3.  Second  Order  Correction 


Let  {ai.n,  •  •  •  ,o„,n}  be  a  sequence  of  column  vectors  in  Rk.  In  the  application  we 
typically  use 

«*,>  =  (Xi  -  X)  with  X  =  iYZ-i*i-  (3-1) 

Note  that  k  denotes  the  dimension  of  X,.  Let  {Yj  :  j  >  1}  be  a  sequence  of  non¬ 
negative  i.i.d.  random  variables  with  a  lattice  distribution  of  span  1.  Let 

H  =  E(Yi),  <r2  =  V(Vi)  >  0, 

~f3  =  E(Y1-fi)3a-3,  p  =  P(Yt  >  0),  q=l-p. 


Further  assume  that  the  support  of  Yx  has  at  least  two  non-zero  values.  Note  that 
73  =  1  if  Yi  has  a  Poisson  distribution  with  mean  p  =  1.  Let 

<3-2) 

c,>  =  Kr'«»  <3-3) 

Pn(x)  =  i  £((c;,„a03  -  3(c'.nl)2(c’,„x)),  (3.4) 


i=i 


where  1  denotes  the  column  vector  in  Rk  with  all  the  entries  equal  to  1,  and  for  any 
vector  t  G  Rk ,  let 


<'“(')  =  ^Ee“v"- 

j—  1 

(3.5) 

Define  for  any  measurable  function  h  on  Rk, 

Mh  =  sup  |/i(x)|(l  +  ||x||)“3, 

(3.6) 
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and  for  any  S  >  0,  x  €  Rk , 


w(h,5]  x)  —  sup  \h(x)  —  h(z)\, 

\\x-z\\<S 


w(h,S)  =  I  w(h,  8‘,  x)<f>k(x)  dx, 

JRk 

where  <£*  denotes  the  density  of  the  fc-variate  standard  normal  distribution. 


Further  define 


N  =  J2Y"  Y  =  N/n 

i=l 

j- 1 

n 

Tn  =  ^2  ty&'b 

i=l  • 


and 


(3.7) 


(3.8) 

(3.9) 

(3.10) 

(3.11) 

(3.12) 


'tjx)  =  (l  +  ^=T4<»(*))&(*)- 
Let  F®(-|m)  denote  the  conditional  distribution  of 

=  w?)u-’ 

i=i 

given  (T„  =  m).  We  now  state  the  main  theorem. 

Theorem  3.1.  Let  YJj=i  «j,n  =  0,  E(Yf)  <  oo  and  for  some  M  >  0 

ElKJ3<  Mn. 

j=i 

Suppose  for  any  0  <  K  <  L  <  oo,  there  exists  a  7  =  j(K,  L)  <  1  such  that 

limsup  sup  |dn(i)|  <  7. 

n-400  AT<II<II<L 

Ifm—np  is  bounded  and  ifh  is  a  real  valued  measurable  function  on  Rk  with  Mh  <  00 > 
then 


(3.13) 


(3.14) 


l/ h(x)(F°(dx\m)  -  4 ’n(x)dx ) 
Rk 

=  o(Mhn~1/ 2)  +  0(tu(/i,  <£„)), 


(3.15) 


for  some  5n  =  o(n  1/2). 
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Proof  of  Theorem  3.1  is  differed  to  the  Appendix. 

In  order  to  apply  Theorem  3.1  to  the  sequential  resampling  procedures,  let  X\ ,  X2, . . . 
be  IID  random  vectors  in  Rk  with  mean  vector  r?  and  dispersion  S.  Let  H  be  a  three 
times  continuously  differentiable  function  in  a  neighborhood  of  77  and  let  /(x)  denote 
the  vector  of  first  order  partial  derivatives  (gradient)  of  H.  Suppose  that  the  distrib¬ 
ution  of  Xi  is  strongly  non-lattice,  and  £||Xi||3  <  00.  Let  /(tj)  ^  0,  02  =  /'(t7)S/(j?)  a 
nd  02  =  /'(X)En/(X),  where  En  is  the  sample  dispersion.  By  taking  ai>n  =  (X,-  -  X), 
we  can  apply  Theorem  3.1  to  arrive  at 


Corollary  3.1.  If  73  =  1,  \i  =  <r,  E(Yf)  <  00  and  if  m  -  np  is  bounded,  then 

sup\/w|P(\/w  <a0n\Tn  =  m,  Xu...  ,Xn) 

-  H(r]))  <a0)^O  (3.16) 


as  n  — *  00,  for  almost  all  sample  sequences  {Xj}. 


Corollary  3.2.  Suppose  the  function  H  on  Rk  is  three  times  continuously  differen¬ 
tiable  in  a  neighborhood  of  the  origin  and  H( 0)  =  0.  If  73  =  1,  E(Y j6)  <  00,  and 
m  —  np  is  bounded.  Then 

supv^l P(V^n(^Yj{Xi-X)Y^  <  ay/UmM\Tn  =  m,  Xu...  ,Xn) 

—P{yfnH{X  -7)  <  aV//'(0)E/(0))|  -+  0  (3.17) 

as  n  — ^  00,  for  almost  all  sample  sequences  {Xj}. 


Corollary  3.2  does  not  assume  any  relation  between  n  and  cr,  so  it  is  applicable 
for  a  wide  range  of  distributions  for  Yi.  In  particular  if  Y\  is  negative  binomial 
with  parameters  r  >  5,  and  P(Yi  =  0)  =  (2  —  (r/ 2)  +  (l/2)\/r(r  —  4))r,  then 
73  =  1,  fj,  =  -r  +  2r(4  -  r  +  yV2  -4r)_1  and  a 2  =  r/(r  -  4). 


Proof  of  Corollary  3.2  is  omitted  as  it  is  similar  to  the  proof  of  Corollary  3.1. 
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Proof  of  Corollary  3.1.  By  expanding  in  Taylor  series,  we  have 

=  0  +  -C^UXLMU*  +  0((logn)3n-')  (3.18) 

Y  \JnYl 

on  \Un\  <  logn  and  \Y  —  fi\  <  f. 

Since  the  distribution  of  X\  is  strongly  non-lattice,  and  E||Xi||3  is  assumed  to  be 
finite,  conditions  (3.13)  and  (3.14)  hold  for  almost  all  sample  sequences  {X,}.  By 
Theorem  3.1  and  by  Lemma  3  of  Babu  and  Singh  (1984),  we  have 

-H(X))<y0n\Tn  =  m,  Xu...,Xn) 

^  t=l  ' 

=  f  (1  +  -j=73qo(x))4>i(x)  dx  +  o(n~1/2)  (3.19) 

7-oo  V« 

uniformly  in  y  for  almost  all  sample  sequences,  where  q0  is  a  polynomial.  Similarly 
from  the  proofs  of  Theorems  20.8  and  24.2  of  Bhattacharya  and  Ranga  Rao  (1986), 

P(V^(H(X)  -  H(fi))  <  a0)  =  [  (l  +  -^=qo(x))Mx)dx  +  °(n~1/2)  (3*20) 

Joo  Vn 

uniformly  in  a.  The  result  now  follows  from  (3.19)  and  (3.20)  as  73  =  1. 

The  most  commonly  used  statistics,  especially  the  studentized  versions,  are  of  the 
type 

(„  =  MH(X)  -  !>(*)),  (3-21) 

where  A  is  a  function  on  Rk  ->•  Rr  and  u  is  a  smooth  real- valued  function  on  Rr . 
The  classical  Student’s  t  is  an  example  of  this  type  of  statistic.  If  X ,  are  univariate, 
then 


satisfies  (3.21)  with  H(x)  =  x ,  A(x)  =  (x2,x),  v(x,y)  =  max(0,(x  -  y2))1/2  and 
sn  =  n  —  X)2 .  The  version  corresponding  to  (3.21)  under  the  Poisson  scheme 

is  generally  of  the  type, 

u(Y)  =  !>*)*). 


(3.22) 
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Corollary  3.3.  Suppose  Y{  is  as  in  Corollary  3.1.  Let 

HE(  A(X,)))  =  (3.23) 

v(X)  =  yJl'(X)SJ(X),  (3.24) 

and  let  L(X,)  be  a  linearly  independent  sub  collection  of  (X,-,  A(X,))  with  the  property 
that  all  the  coordinates  of  ( X; ,  A(X,))  can  be  expressed  as  linear  combinations  of  those 
of  L(Xi).  If  the  distribution  of  L(X i)  is  strongly  non-lattice,  F||Z/(Xi)||3  <  oo,  and 
if  m  —  np  is  bounded  then 

supv/u|^’(^n(X)  <  a\Tn  =  m,  Xj,...  ,Xn)  —  P(tn  <  a)|  — >  0,  (3.25) 

a 

asn-f  oo,  for  almost  all  sample  sequences  {X,}. 

Remark  1.  If  Yi  is  a  Poisson  random  variable  with  mean  1,  then  p  =  a  =  73  =  1.  If 
m  =  [n(l  -  e-1)]  +  1,  then  0  <  m  -  np  <  1.  So  Corollaries  3.1  and  3.3  are  applicable 
for  the  basic  Poisson  scheme  described  in  Section  2. 

4.  Appendix 

To  establish  Theorem  3.1,  we  need  some  preliminary  results.  Let 

F„(x,r,m)  =  P(Un  <  x,  N  =  r,  T„  =  m),  (4.1) 

where  1/  <  x  mean,  the  inequalities  hold  coordinate  wise.  Let 

z  -  (yi  -  /x,  /{k,>o>  -  pY , 

and 

$n(x,t/)  =  (1  +  —r=Qn(x,y))<f>k(x)<po(y)i  (4.2) 

yn 

where  (fo  denotes  the  density  of  the  bivariate  normal  distribution  with  zero  mean 
vector  and  dispersion 
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Suppose 

Qn(x,y )  =  73  Pn{x)  + 

-  ^(E({Z^oly)(Yi  -  v)2P<l  +  ^(<1  -  P)I{Yi> 0}))/detEo) 

+  \(M2  -  (4-4) 

Since  the  support  of  Vi  is  assumed  to  include  at  least  two  non-zero  values,  it  follows 
that  So  is  positive  definite. 


Proposition  4.1. 

Under  the  conditions  of  Theorem  3.1, 

/  h(x)(nFn(dx,  r,  m)  -  ^(x,  yr, um)dx) 

J 

Rk 

=  (o(Mhn~l/2)  -1-  0(w(fi,5n)))v7o(yn,wm) 

1  1=1 

(4.5) 

uniformly  in 

t /r  =  (r  -  np)n~1/2,  u>m  =  (m  -  np)n~1/2, 

(4.6) 

for  some  Sn  =  o(n  1 /2). 

Remark  2.  Under  the  conditions  of  Theorem  3.1, 

sup  || a j, n||  =  0(n1/3),  (4.7) 

l<j<n 

so  sup  ||aj,„||n_1/2  -4  0  as  n  -4  oo.  Consequently 
l<j<n 

sup  E(Y? /{Villa,.  „||>>m) 

l<j<n 

=  sup  ||OMrn-3«E(yf/w,|.,,n||>v;!)) 

1  <j<H 

=  o(n~112). 

Hence  the  last  term  in  (4.5)  can  be  replaced  by  o(M/ln-1). 

The  proof  of  Proposition  4.1  is  similar  to  that  of  Theorem  1  of  Babu  and  Bai 
(1996).  It  uses  truncation  arguments  in  the  proof  of  Theorem  20.8  of  Bhattacharya 
and  Ranga  Rao  (1986).  The  Proposition  follows  from  Lemmas  4.1,  4.2  and  4.3  that 
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are  stated  below,  and  Lemma  4  of  Babu  and  Bai  (1996).  Lemmas  4.1,  4.2  and  4.3 
are  modified  versions  of  the  first  three  lemmas  of  Babu  and  Bai  (1996).  The  measure 
J  in  Babu  and  Bai  (1996)  is  assumed  to  satisfy  /  \\x\\k+14dJ(x)  <  oo. 

Before  stating  the  lemmas,  we  introduce  some  notation.  For  non-singular  integral 
vector  a'  =  (oti, ...  ,as)  and  z'  =  (21, . . .  , zs)  €  Rs,  we  write 

|a|  =  a1  +  ...  +  as,  za  =  z°l...  z°‘  and  Da  =  D°'  . . .  , 

where  Dj  denotes  the  partial  derivative  with  respect  to  the  j-th  coordinate. 

Lemma  4.1.  Let  g  be  a  real  valued  function  on  Rk  x  satisfying 

^2  /(i +  lla:ll)3+fc+1lfl,(^,^)M;r  <  °°> 

rn^ZJ  Rk 

for  some  non-negative  s.  Then  there  exists  a  constant  c(k)  depending  only  on  k  such 
that,  for  all  m  G  Z3 , 

/(!  +  ||x||*)|fl'(®,m)|da; 

Rk 

G  Rk 

where  G  =  [— 7T,  tt]j  and  g  denotes  the  Fourier  transform  of  g. 


Proof  of  Lemma  4.1  is  similar  to  that  of  Lemma  1  of  Babu  and  Bai  (1996)  and 
hence  omitted. 

Lemma  4.2.  Let  Y°  and  e°  have  the  same  distributions  as  that  of  Y  and  e  respec¬ 
tively.  Suppose  (Y,s)  and  (y°,e°)  are  independent.  Then 

-  J2  \E{e*a*»Y+iv,€)\3  <  E\dn(t(Y  -  y°))| 

Tt 


Proof  of  Lemma  2.  The  left  side  of  (4.8)  is  same  as 


l 

n 


^^(e^,n(y-r0)e.V(£-£°)) 
j= 1 


=E{dn{t(Y  -  r°))e*V(€-£0)) 


<E\dn(t(Y  -Y°))\. 


(4.8) 
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Lemma  4.3.  Let  E(Y j3)  <  Mi  and  the  smallest  eigenvalue  o/Eo  is  bounded  below  by 
<J\  >  0.  Suppose  (3.13)  holds.  Then  for  ||t||  <  n-1/,2logn  and  Mn-1/,2logn  <  ||i/||  < 
n,  we  have  for  any  C  C  {1, . . .  ,  n}, 


JJ  E(eit,aj’nYl+i,/'z) 


jec 


<  ki  exp(6  —  fc2(logn)2) 


where  b  denotes  the  number  of  integers  in  C  and  k\,k2  >  0  are  positive  constants 
depending  only  on  and  <Tj. 


Proof  of  Lemma  3.  Since 

le-i^-i»rE{ei»Yl+iui{Yl>0})  _  !  +  I(w,„)So(o;,n)/|  <  u)\\3Mu 

there  exists  a0<5<7r/8  and  Ai  >  0  depending  only  on  (j\  and  M\  such  that 

i  <  |£(e,'("'“>z)|  <  1  -  A i(u2  +  u2), 

whenever  |w|  <  45  and  |u|  <  45.  Suppose  ||f||  <  n-1^2logn,  and  Mn-1/2  log  n  < 
j|i/||  <  5.  Then  for  some  A  >  0 


eit'ai'„Yi+iv'Z 


)  <  Ilf1  -  A«‘VP  +  IMP) 

i= i 


C  g— An||PVn||2— nA||«/||2 


<  g— AAf(logn)2 


(4.9) 


Further  observe  that 


|£(e,w1+,vjm>0})|  =  \q  +  pe^E{eiu,Yl\Yi  >  0)| 

<q  +  p\E{jwY'\Yi  >0)1 

<l-7o<l, 

for  some  70  >  0,  whenever  |  <  |w|  <  n  +  5.  Hence 

|£(e‘<S>n+«V*)|  <  1  _  7o, 

whenever  ||i||  <  n-1/2  log  n  and  5  <  ||v||  <  n.  This  completes  the  proof  of  Lemma 
4.3. 
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n 

Proof  of  Theorem  3.1.  Let  Hn  denote  the  indicator  function  of  (2|^  —  |  >  p). 

j=i 

Suppose  m  —  np  is  bounded.  By  Markov  inequality,  E(Hn )  =  0(n-3).  Hence 

E(Hn\Tn  =  m)  =  E(Hn)/P(Tn  =  m) 

=  0(n-3)(P(Tn  =  m))"1 
=  0(n-5/2). 

Clearly 

£(y1ffniv-1|rn  =  m)  =  -  V'£;(y;Hniv-1|rn  =  m) 

n 

*=i 

=  —E(Hn\Tn  =  m) 

n 

=  0(n-7/2). 

Consequently,  there  exists  a  constant  M2  >  0,  such  that 

n 

M2E(h(y/np^2cjtnYj/(Na))\Tn  =  m) 

3= 1 

n 

<  E{H,\Tn  =  m)  +  n3'!£(|| 2]ci,„V;iV-1H„||3|T„  =  m) 

j=l 

=  0(n-T/2)  +  n3'JE(£  =  m) 

J  =  1 

=  0(n“7/2)  +  0(n3/2£(/fn|rn  =  m))| 

=  0(n"1).  (4.10) 

By  (3.12),  (4.1)  and  (4.6),  we  have 

OO 

F°(x|m)  =  E  F„(x(  1  +  (yj/pVn)),  yj,um)/ P(Tn  =  m), 

3- 1 

and  by  Proposition  4.1, 

1-7=  E  -  (yj/vVn)) 

x  ( nFn(dx,yj,um )  -  ^n(x,yj,u3m)dx\ 

=  (o(M/ji-1/2)  +  0MMn)))4=  E  <^o(yi,o;m)  +  o(M^n-1/2) 

Vn 


=  o(Mhti  I,2)  +  0 (ui(h,6„)). 


(4.11) 
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By  Theorem  13  on  local  Edgeworth  expansions  on  pages  205-206  of  Petrov  (1975), 
we  have 

'/pv‘P(Tn  =  «)  =  *KM5)(  i  +  ^=((^)3  -  %)) 

(4.12) 

The  Theorem  now  follows  from  (4.11)  and  (4.12),  as  in  the  proof  of  Theorem  6  of 
Babu  and  Bai  (1996). 
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